QueryCat: automatic categorization of MEDLINE queries
نویسندگان
چکیده
A searcher's inability to formulate an appropriate query can result in an overwhelming number of retrieved documents. Our approach to this problem is to use information about common types or categories of queries to (1) reformulate the user's initial query and (2) create an informative organization of the retrieved documents from the reformulated query. To achieve these goals, we first must identify which common categories or types of queries are the best abstraction of the user's specific query. In this paper, we describe a system that performs this first step of categorizing the user's query. Our system uses a two-phased approach: a lexical analysis phase, and a semantic analysis phase. An evaluation of our system demonstrates that its query categorization corresponds reasonably well to the query categorizations by medical librarians and physicians.
منابع مشابه
Query Translation by Text Categorization
We report on the development of a cross language information retrieval system, which translates user queries by categorizing these queries into terms listed in a controlled vocabulary. Unlike usual automatic text categorization systems, which rely on dataintensive models induced from large training data, our automatic text categorization tool applies data-independent classifiers: a vector-space...
متن کاملQuery and Document Translation by Automatic Text Categorization: A Simple Approach to Establish a Strong Textual Baseline for ImageCLEFmed 2006
In this paper, we report on the fusion of simple retrieval strategies with thesaural resources in order to perform document and query translation for cross–language retrieval in a collection of medical cases. The collection contains textual and visual contents. In this paper, we focus on the textual contents of the collection, which contains documents in three languages: French, English and Ger...
متن کاملAutomatic Text Categorization and Its Applicationto Text
We develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instancebased learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the e ectiveness of our categorization approach using two real-world document collections f...
متن کاملAutomatic Text Categorization and Its Application to Text Retrieval
ÐWe develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instance-based learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the effectiveness of our categorization approach using two realworld document collections...
متن کاملUsing Discourse Analysis to Improve Text Categorization in MEDLINE
PROBLEM Automatic keyword assignment has been largely studied in medical informatics in the context of the MEDLINE database, both for helping search in MEDLINE and in order to provide an indicative "gist" of the content of an article. Automatic assignment of Medical Subject Headings (MeSH), which is formally an automatic text categorization task, has been proposed using different methods or com...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings. AMIA Symposium
دوره شماره
صفحات -
تاریخ انتشار 2000